# Noise robustness
Ultravox V0 6 Qwen 3 32b
MIT
Ultravox is a large multimodal speech language model capable of understanding and processing speech input, supporting multiple languages and noisy environments.
Audio-to-Text
Transformers Supports Multiple Languages

U
fixie-ai
1,240
0
Ichigo Llama3.1 S Instruct V0.4
Apache-2.0
A multimodal language model based on Llama-3 architecture, supporting audio and text input understanding with noise robustness and multi-turn dialogue capabilities
Text-to-Audio
Safetensors English
I
homebrewltd
486
19
Whisper Small Ita
Apache-2.0
An Italian-optimized speech recognition model based on OpenAI Whisper-small, enhanced with special tags for improved metadata capture
Speech Recognition
Transformers Supports Multiple Languages

W
litus-ai
193
8
Whisper Medium.en Fine Tuned For ATC
MIT
Fine-tuned based on the OpenAI Whisper Medium EN model, specifically optimized for speech recognition of air traffic control communications, with an 84% reduction in word error rate
Speech Recognition
Safetensors English
W
jacktol
2,525
1
Byt5 Small
Apache-2.0
ByT5 is a tokenizer-free version of Google's T5 that directly processes raw UTF-8 bytes, supporting multilingual text processing with excellent performance on noisy data.
Large Language Model Supports Multiple Languages
B
google
1.4M
69
Byt5 Base
Apache-2.0
ByT5 is a tokenizer-free version of Google's T5 that directly processes UTF-8 byte sequences, supporting multilingual text processing with robustness to noisy data.
Large Language Model Supports Multiple Languages
B
google
24.17k
22
Featured Recommended AI Models